Robust pitch detection by narrow band spectrum analysis
نویسندگان
چکیده
This paper proposes a new technique for detecting pitch patterns which is useful for automatic speech recognition, by using a narrow band spectrum analysis. The motivation of this approach is that humans perceive some kind of pitch in whispers where no fundamental frequencies can be observed, while most of the pitch determination algorithm (PDA) fails to detect such perceptual pitch. The narrow band spectrum analysis enable us to find pitch structure distributed locally in frequency domain. Incorporating this technique into PDA’s is realized to applying the technique to the lag window based PDA. Experimental results show that pitch detection performance could be improved by 4% for voiced sounds and 8% for voiceless sounds.
منابع مشابه
Automatic detection of voice creak
The analysis of large spontaneous speech corpora reveals that creaky mode appears more frequently than expected, especially for young female speakers. Creaky mode usually creates fundamental frequency measurement errors and creaky voice segments must be often identified manually beforehand to avoid erroneous reading of F0 in large speech databases. Various approaches have been proposed to ident...
متن کاملSpeech analysis using instantaneous frequency deviation
In this paper, our aim is to derive a phase spectrum representation computed via the short-time Fourier transform. Specifically, we are interested in developing a narrow-band speech representation – employing 20-40 ms analysis windows. Furthermore, this representation should be as physically meaningful as the magnitude spectrum. To achieve these ends, we concentrate on instantaneous frequency (...
متن کاملSpeech Modulation Features for Robust Nonnative Speech Accent Detection
In this paper, we propose to use speech modulation features for robust nonnative accent detection. Modulation spectrum carries long term temporal information of speech and may discriminate accents of native and nonnative speakers. For each speech segment to be tested, we extract a 10 dimension feature vector from modulation spectrum and use it for model training and testing. The proposed modula...
متن کاملRobust Pitch Detection Based on Recurrence Analysis and Empirical Mode Decomposition
A new pitch detection method is designed by the recurrence analysis in this paper, which is combined of Empirical Mode Decomposition (EMD) and Elliptic Filter (EF). The Empirical Mode Decomposition (EMD) of Hilbert-Huang Transform (HHT) is utilized tosolve the problem, and a noisy voice is first filtered on the elliptic band filter. The two Intrinsic Mode Functions (IMF) are synthesized by EMD ...
متن کاملRobust Speech and Bird Song Processing using Multi-band Correlograms and Sparse Representations
of the Dissertation Robust Speech and Bird Song Processing using Multi-band Correlograms and Sparse Representations by Lee Ngee Tan Doctor of Philosophy in Electrical Engineering University of California, Los Angeles, 2014 Professor Abeer Alwan, Chair This dissertation focuses on algorithms for robust speech and bird song processing. Many applications perform well under ideal signal conditions,...
متن کامل